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AMPLIFICATIONS OF CHROMOSOMAL. REGION 20ql3 AS A 
5 PROGNOSTIC INDICATOR IN BREAST CANCER 

BACKGROUND OF THE INVENTION 
This invention pertains to the field to the field of cytogenetics. More 
particularly this invention pertains to the identification of a amplification at about 20ql3 
that is a good prognositc indicator of various cancers. In addition, this invention 
10 provides a number of probes specific for the 20ql3 amplicon. 

This invention was made with Government support under Grant Nos. CA- 
528207 and CA-44768 awarded by the National Institutes of Health and Contract Nos. 
DE-AC-03-76SF00098, W-7405-ENG-48 and W-7405-ENG-36 awarded by the U.S. 
Department of Energy. The Government has certain rights in this invention. 
15 Chromosome abnormalities are often associated with genetic disorders, 

degenerative diseases, and cancer. In particular, the deletion or multiplication of copies 
of whole chromosomes or chromosomal segments, and higher level amplifications of 
specific regions of the genome are common occurrences in cancer. See, for example 
Smith, et al.. Breast Cancer Res. Treat., 18: Suppl. 1: 5-14 (1991, van de Vijer & 
JO Nusse. Biochim. Biophys. Acta. 1072: 33-50 (1991), Sato, et al.. Cancer. Res., 50: 

7184-7189 (1990). In fact, the amplification and deletion of DNA sequences containing 
proto-oncogenes and tumor-suppressor genes, respectively, are frequently characteristic 
of tumorigenesis. Dutrillaux, et al.. Cancer Genet. Cytogenet., 49: 203-217 (1990). 
Clearly the identification of amplified and deleted regions and the cloning of the genes 
5 involved is crucial both to the study of tumorigenesis and to the development of cancer 
diagnostics. 

The detection of amplified or deleted chromosomal regions has 
traditionally been done by cytogenetics. Because of the complex packing of DNA into 
the chromosomes, resolution of cytogenetic techniques has been limited to regions larger 
3 than about 10 Mb; approximately the width of a band in Giemsa-siained chromosomes. 
In complex karyotypes with multiple translocations and other genetic changes, traditional 
cytogenetic analysis is of little utility because karyotype information is lacking or cannot 
be interpreted. Teyssier, J.R., Cancer Genet. Cytogenet., 37: 103 (1989). Furthermore 
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conventional cytogenetic banding analysis is time consuming, labor intensive, and 
frequently difficult or impossible. 

More recently, cloned probes have been used to assess the amount of a 
given DNA sequence in a chromosome by Southern blotting. This method is effective 
5 even if the genome is heavily rearranged so as to eliminate useful karyotype information. 
However, Southern blotting only gives a rough estimate of the copy number of a DNA 
sequence, and does not give any information about the localization of that sequence 
within the chromosome. 

Comparative genomic hybridization (CGH) is a more recent approach to 

10 identify the presence and localization of amplified/deleted sequences. See Kallioniemi, et 
al. Science, 258: 818 (1992). CGH, like Southern blotting, reveals amplifications and 
deletions irrespective of genome rearrangement. Additionally, CGH provides a more 
quantitative estimate of copy number than Souther blotting, and moreover also provides 
information of the localization of the amplified or deleted sequence in the normal 

15 chromosome. 

Using CGH, the chromosomal 20ql3 region has been identified as a region 
that is frequently amplified in cancers {see, e.g. copending application U.S.S.N. 
08/132,808, filed on October 6, 1993). Initial analysis of this region in breast cancer 
cell lines identified a region approximately 2 Mb on chromosome 20 that is consistently 
20 amplified. 

SUMMARY OF THE INVENTION 
The present invention relates to the identification of a narrow region (about 
600 kb) within a 2 Mb amplicon located at about chromosome 20ql3 (more precisely at 

25 20qI3.2) that is consistently amplified in primary tumors. In addition this invention 
provides a contig (a series of clones that contiguously spans this amplicon) as well as 
sequence information for a large number of exons and cDNAs located within the contig. 
The contig or components thereof can be used to prepare probes specific for the 
amplicon. The probes can be used to detect chromosomal abnormalities at 20ql3. 

30 Thus, in one embodiment, this invention provides a method of detecting a 

chromosome abnormality (e.g., an amplification or a deletion) at about position FLpter 
0.825 on human chromosome 20 (20ql3.2). The method involves contacting a 
chromosome sample from a patient with a composition consisting essentially of one or 
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more labeled nucleic aqid probes each of which binds selectively to a target 
polynucleotide sequence at about position FLpter 0.825 on human chromosome 20 under 
conditions in which the probe forms a stable hybridization complex with the target 
sequence; and detecting the hybridization complex. The step of detecting the 
5 hybridization complex can involve determining the copy number of the target sequence. 
The probe preferably comprises a nucleic acid that specifically hybridizes under stringent 
conditions to a nucleic acid selected from the nucleic acids listed in Table 1 or Table 2. 
Even more preferably, the probe is one or more nucleic acids selected from the nucleic 
acids listed in Table 1 or Table 2. The probe is preferably labeled, and is more 
10 preferably labeled with digoxigenin or biotin. In one embodiment, the hybridization 
complex is detected in interphase nuclei in the sample. Detection is preferably carried 
out by detecting a fluorescent label (e.g., FITC, fluorescein, or Texas Red). The 
method can further involve contacting the sample with a reference probe which binds 
selectively to a chromosome 20 centromere. 

In another embodiment, this invention provides for probes that specifically 
bind to the 20ql3 amplicon. Thus, this invention provides for a composition comprising 
a labeled nucleic acid probe which binds selectively to a target polynucleotide sequence 
at about FLpter 0.825 on human chromosome 20. The probes comprise one or more 
nucleic acids selected from the group consisting of the nucleic acids listed in Table 1 or - 
20 Table 2. In a preferred embodiment, the probes are labelled with digoxigenin or biotin. 

This invention also provides for kits for the detection of a chromosomal 
abnormality at about position FLpter 0.825 on human chromosome 20. The kits include 
a compartment which contains a labeled nucleic acid probe which binds selectively to a 
target polynucleotide sequence at about FLpter 0.825 on human chromosome 20. The 
probe preferably includes at least one nucleic acid that specifically hybridizes under 
stringent conditions to a nucleic acid selected the nucleic acids listed in Table 1 or Table 
2. Even more preferably, the probes comprise one or more nucleic acids selected from 
the nucleic acids listed in Table 1 or Table 2. In a preferred embodiment, the probes are 
labelled with digoxigenin or biotin. The kit may further include a reference probe 
30 specific to a sequence in the centromere of chromosome 20. 
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Definitions 



A "chromosome sample" as used herein refers to a tissue or cell sample 



prepared for standard in situ hybridization methods described below. The sample is 
prepared such that individual chromosomes remain substantially intact and typically 



techniques. 

"Nucleic acid M refers to a deoxyribonucleotide or ribonucleotide polymer 
in either single- or double-stranded form, and unless otherwise limited, would encompass 
known analogs of natural nucleotides that can function in a similar manner as naturally 
occurring nucleotides. 

"Subsequence 1 * refers to a sequence of nucleic acids that comprise a part of 
a longer sequence of nucleic acids. 

A "probe" or a "nucleic acid probe" , as used herein, is defined to be a 
collection of one or more nucleic acid fragments whose hybridization to a target can be 
detected. The probe is labeled as described below so that its binding to the target can be 
detected. The probe is produced from a source of nucleic acids from one or more 
particular (preselected) portions of the genome, for example one or more clones, an 
isolated whole chromosome or chromosome fragment, or a collection of polymerase 
chain reaction (PCR) amplification products. The probes of the present invention are 
produced from nucleic acids found in the 20ql3 amplicon as described herein. The 
probe may be processed in some manner, for example, by blocking or removal of 
repetitive nucleic acids or enrichment with unique nucleic acids. Thus the word "probe" 
may be used herein to refer not only to the detectable nucleic acids, but to the detectable 
nucleic acids in the form in which they are applied to the target, for example, with the 
blocking nucleic acids, etc. The blocking nucleic acid may also be referred to 
separately. What "probe" refers to specifically is clear from the context in which the 
word is used. 

"Hybridizing" refers the binding of two single stranded nucleic acids via 
complementary base pairing. 

"Bind(s) substantially" or "binds specifically" or "binds selectively" or 
"hybridizing specifically to" refers to complementary hybridization between an 
oligonucleotide and a target sequence and embraces minor mismatches that can be 
accommodated by reducing the stringency of the hybridization media to achieve the 



comprises metaphase spreads or interphase nuclei prepared according to standard 
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desired detection of the target polynucleotide sequence. These terms also refer to the 
binding, duplexing, or hybridizing of a molecule only to a particular nucleotide sequence 
under stringent conditions when that sequence is present in a complex mixture (e.g., total 
cellular) DNA or RNA. The term "stringent conditions" refers to conditions under 
5 which a probe will hybridize to its target subsequence, but to no other sequences. 
Stringent conditions are sequence-dependent and will be different in different 
circumstances. Longer sequences hybridize specifically at higher temperatures. 
Generally, stringent conditions are selected to be about 5°C lower than the thermal 
melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm 
10 is the temperature (under defined ionic strength, pH, and nucleic acid concentration) at 
which 50% of the probes complementary to the target sequence hybridize to the target 
sequence at equilibrium. Typically, stringent conditions will be those in which the salt 
concentration is at least about 0.02 Na ion. concentration, (or other salts) at pH 7.0 to 8.3 
and the temperature is at least about 60°C for short probes. Stringent conditions may 
15 also be achieved with the addition of destabilizing agents such as formamide. 

One of skill will recognize that the precise sequence of the particular 
probes described herein can be modified to a certain degree to produce probes that are 
"substantially identical" to the disclosed probes, but retain the ability to bind substantially 
to the target sequences. Such modifications are specifically covered by reference to the 
20 individual probes herein. The term "substantial identity" of polynucleotide sequences 
means that a polynucleotide comprises a sequence that has at least 90% sequence 
identity, more preferably at least 95%, compared to a reference sequence using the 
methods described below using standard parameters. 

Two nucleic acid sequences are said to be "identical" if the sequence of 
25 nucleotides in the two sequences is the same when aligned for maximum correspondence 
as described below. The term "complementary to" is used herein to mean that the 
complementary sequence is identical to all or a portion of a reference polynucleotide 
sequence. 

-Sequence comparisons between two (or more) polynucleotides are typically 
30 performed by comparing sequences of the two sequences over a "comparison window" to 
identify and compare local regions of sequence similarity. A "comparison window", as 
used herein, refers to a segment of at least about 20 contiguous positions, usually about 
50 to about 200, more usually about 100 to about 150 in which a sequence may be 
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compared to a reference sequence of the same number of contiguous positions after the 

two sequences are optimally aligned. 

Optimal alignment of sequences for comparison may be conducted by the 

local homology algorithm of Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by 
5 the homology alignment algorithm of Needleman and Wunsch J. MoL Biol. 48:443 

(1970), by the search for similarity method of Pearson and Lipman Proc. Natl. Acad. 

ScL (U.S.A.) 85: 2444 (1988), by computerized implementations of these algorithms. 

-Percentage of sequence identity" is determined by comparing two 

optimally aligned sequences over a comparison window, wherein the portion of the 
10 polynucleotide sequence in the comparison window may comprise additions or deletions 

(i.e., gaps) as compared to the reference sequence (which does not comprise additions or 

deletions) for optimal alignment of the two sequences. The percentage is calculated by 

determining the number of positions at which the identical nucleic acid base or amino 

acid residue occurs in both sequences to yield the number of matched positions, dividing 
15 the number of matched positions by the total number of positions in the window of 

comparison and multiplying the result by 100 to yield the percentage of sequence 

identity. 

Another indication that nucleotide sequences are substantially identical is if 
two molecules hybridize to the same sequence under stringent conditions. Stringent 
20 conditions are sequence dependent and will be different in different circumstances. 
Generally, stringent conditions are selected to be about 5°C lower than the thermal 
1 melting point (TJ for the specific sequence at a defined ionic strength and pH. The Tm 
is the temperature (under defined ionic strength and pH) at which 50% of the target 
sequence hybridizes to a perfectly matched probe. Typically, stringent conditions will be 
25 those in which the salt concentration is about 0.02 molar or lower at pH 7 and the 
temperature is at least about 60°C. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1(A) shows disease-free survival of 129 breast cancer patients 
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according to the level of 20ql3 amplification. Patients with tumors having high level 
20ql3 amplification have a shorter disease-free survival (p=0.04 by Mantel-Cox test) 
compared to those having no or low level amplification. 

Figure 1(B) Shows the same disease-free survival difference of Figure 
5 4(A) in the sub-group of 79 axillary node-negative patients (p=0.0022 by Mantel-Cox 
test). 

Figure 2 shows a comparison of 20ql3 amplification detected by FISH in a 
primary breast carcinoma and its metastasis from a 29-year patient. A low level 
amplification of 20ql3 (20ql3 compared to 20p reference probe) was found in the 
10 primary tumor. The metastasis, which appeared 8. months after mastectomy, shows a 
high level amplification of the chromosome 20ql3 region. The overall copy number of 
chromosome 20 (20p reference probe) remained unchanged. Each data point represents 
gene copy numbers in individual tumor cells analyzed. 

Figure 3 shows a graphical representation of the molecular cytogenetic 
15 mapping and subsequent cloning of the 20ql3.2 amplicon. Genetic distance is indicated 
in centiMorgans (cM). The thick black bar represents the region of highest level 
amplification in the breast cancer cell line BT474 and covers a region of about 1.5 Mb. 
PI and BAC clones are represented as short horizontal lines and YAC clones as heavier 
horizontal lines. Not all YAC and PI clones are shown. YACs 957f3, 782c9, 931hl2, 
20 and 90 lb 12 are truncated. Sequence tagged sites (STSs) appear as thin vertical lines and 
open circles indicate that a given YAC has been tested for and is positive for a given 
STS. Not all STSs have been tested on all YACs. The interval from which more than 
100 exons have been trapped is represented as a filled box. The 600 kb interval 
spanning the region of highest amplification level in primary tumors is represented by the 
25 filled black box (labeled Sequence). The lower part of the figure shows the levels of 
amplification in two primary tumors that further narrow the region of highest 
amplification to within about 600 kb. 

Figure 4 provides a higher resolution map of the amplicon core as defined 
in primary tumors. 



30 



DETAILED DESCRIPTION 

This invention provides a number of nucleic acid probes useful for the 
detection of chromosomal abnormalities at 20ql3. Studies using comparative genomic 
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hybridization (CGH) have shown that a region at chromosome 20ql3 is increased in copy 
number frequently in cancers of the breast (-30%), ovary (~ 15%) t bladder (-30%), 
head and neck (-75%) and colon ( ~ 80%), This suggests the presence of one or more 
genes that contribute to the progression of several solid tumors are located at 20ql3. 
5 Gene amplification is one mechanism by which dominantly acting 

oncogenes are overexpressed, allowing tumors to acquire novel growth characteristics 
and/or resistance to chemotherapeutic agents. Loci implicated in human breast cancer 
progression and amplified in 10-25% of primary breast carcinomas include the erbB-2 
locus (Lupu et aL, Breast Cancer Res. Treat., 27: 83 (1993), Slamon et at. Science, 

10 235: 177-182 (1987), Heiskanen et aL Biotechniques, 17: 928 (1994)) at 17ql2, cyclin-D 
(Mahadevan et aL. Science, 255: 1253-1255 (1993), Gillett et aL, Cane. Res., 54: 1812 
(1994)) at 1 lql3 and MYC (Gaffey et aL, Mod. Pathol., 6: 654 (1993)) at 8q34. 

Pangenomic surveys using comparative genomic hybridization (CGH) 
recently identified about 20 novel regions of increased copy number in breast cancer 

15 (Kallioniemi et aL, Genomics. 20: 125-128 (1994)). One of these loci, band 20ql3, was 
amplified in 18% of primary tumors and 40% of cell lines (Kallioniemi et aL, Genomics, 
20: 125-128 (1994)). More recently, this same region was found amplified in 15% of 
ovarian, 80% of bladder and 80% of colorectal tumors. The resolution of CGH is 
limited to 5-10 Mb. Thus, FISH was performed using locus specific probes to confirm 

20 the CGH data and precisely map the region of amplification. 

The 20ql3 region has been analyzed in breast cancer at the molecular level 
and a region, approximately 600 kb wide, that is consistently amplified was, identified, 
as described herein. Moreover, as shown herein, the importance of this amplification in 
breast cancer is indicated by the strong association between amplification and decreased 

25 patient survival and increased tumor proliferation (specifically, increased fraction of ceils 
in S-phase). 

In particular, as explained in detail in Example 1, high-level 20ql3 
amplification was associated (p=0.0022) with poor disease free survival in node-negative 
patients, compared to cases with no or low-level amplification (Figure 1). Survival of 
30 patients with moderately amplified tumors did not differ significantly from those without 
amplification. Without being bound to a particular theory, it is suggested that an 
explanation for this observation may be that low level amplification precedes high level 
amplification. In this regard, it may be significant that one patient developed a local 
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metastasis with high-level 20ql3.2 amplification 8 month after resection of a primary 
tumor with low level amplification (Figure 3). 

The 20ql3 amplification was associated with high histologic grade of the 
tumors. This correlation was seen in both moderately and highly amplified tumors. 
5 There was also a correlation (p=0.0085) between high level amplification of a region 
complementary to a particular probe, RMC20C001 (Tanner et al., Cancer Jtes., 54: 
4257-4260 (1994)), and cell proliferation, measured by the fraction of cells in S-phase 
(Figure 4). This finding is important because it identifies a phenotype that can be scored 
in functional assays, without knowing the mechanism underlying the increased S-phase 
10 fraction. The 20ql3 amplification did not correlate with the age of the patient, primary 
tumor size, axillary nodal or steroid hormone-receptor status. 

This work localized the 20ql3.2 amplicon to an interval of approximately 
2 Mb. Furthermore, it suggests that high-level amplification, found in 7% of the 
tumors, confers an aggressive phenotype on the tumor, adversely affecting clinical 
15 outcome. Low level amplification (22% of primary tumors) was associated with 

pathological features typical of aggressive tumors (high histologic grade, aneuploidy and 
cell proliferation) but not patient prognosis. 

In addition, it is shown herein that the 20ql3 amplicon (more precisely the 
20ql3.2 amplicon) is one of three separate co-amplified loci on human chromosome 20 
20 that are packaged together throughout the genomes of some primary tumors and breast 
cancer cell lines. No known oncogenes map in the 20ql3.2 amplicon. 

Identification of 20n13 A mblicon Prn h^, 

Initially, a paucity of available molecular cytogenetic probes dictated that 
25 FISH probes be generated by the random selection of cosmids from a chromosome 20 
specific library, LA20NC01, and map them to chromosome 20 by digital imaging 
microscopy. Approximately 46 cosmids, spanning the 70 Mb chromosome, were 
isolated for which fractional length measurements (FLpter) and band assignments were 
obtained. Twenty six of the cosmids were used to assay copy number in the breast 
30 cancer cell line BT474 by interphase FISH analysis. Copy number was determined by 
counting hybridization signals in interphase nuclei. This analysis revealed that cosmid 
RMC20C001 (Flpter, 0.824; 20ql3.2), described by Stokke et al.. Genomics, 26: 134- 
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137 (1995), defined the highest-level amplification ("60 copies/cell) in BT474 cells 
(Tanner et al, Cancer Res., 54: 4257-4260 (1994)). 

PI clones containing genetically mapped sequences were selected from 
20ql3.2 and used as FISH probes to confirm and further define the region of 
5 amplification. Other PI clones were selected for candidate oncogenes broadly localized 
to the 20ql3.2 region (Flpter, 0.81-0.84). These were selected from the DuPont PI 
library (Shepherd, et al. Proc. Natl. Acad. Sci. USA, 92: 2629 (1994), available 
commercially from Genome Systems), by PCR (Saiki et al, Science, 230: 1350 (1985)) 
using primer pairs developed in the 3' untranslated region of each candidate gene. Gene 

10 specific PI clones were obtained for, protein tyrosine phosphatase (PTPN1, Flpter 0.78), 
melanocortin 3 receptor (MC3R, Flpter 0.81), phosphoenolpymvate carboxy kinase 
(PCK1, Flpter 0.85), zinc finger protein 8 (ZNF8, Flpter 0.93), guanine 
nucleotide-binding protein (GNAS 1, Flpter .873), src-oncogene (SRC, Flpter 0.669), 
topoisomerase 1 (TOPI, Flpter 0.675), the bcl-2 related gene bcl-x (Flpter 0.526) and 

15 the transcription factor E2F-1 (FLpter 0.541). Each clone was mapped by digital 

imaging microscopy and assigned Flpter values. Five of these genes (SRC, TOPOl, 
GNAS 1 , E2F-1 and BCl-x) were excluded as candidate oncogenes in the amplicon 
because they mapped well outside the critical region at Flpter 0.81-0.84. Three genes 
(PTPNR1, PCK-1 and MC3R) localized close enough to the critical region to warrant 

20 further investigation. 

Interphase FISH on 14 breast cancer cell lines and 36 primary tumors 
using 24 cosmid and 3 gene specific PI (PTPNRL, PCK-1 and MC3R) probes found 
high level amplification in 35% (5/14) of breast cancer cell lines and 8% (3/36) of 
primary tumors with one or more probe. The region with the highest copy number in 

25 4/5 of the cell lines and 3/3 primary tumors was defined by the cosmid RMC20C001. 

This indicated that PTPNR1, PCK1 and MC3R could also be excluded as candidates for 
oncogenes in the amplicon and, moreover, narrowed the critical region from 10 Mb to 
1.5-2.0 Mb (see, Tanner etal.. Cancer Res., 54: 4257-4260 (1994). 

Because probe RMC20C001 detected high-level (3 to 10-fold) 20ql3.2 

30 amplification in 35% of cell lines and 8% of primary tumors it was used to (1) define the 
prevalence of amplification in an expanded tumor population, (2) assess the frequency 
and level of amplification in these tumors, (3) evaluate the association of the 20ql3.2 
amplicon with pathological and biological features, (4) determine if a relationship exists 
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between 20ql3 amplification and clinical outcome and (5) assess 20ql3 amplification in 
metastatic breast tumors. 

As detailed in Example l t fluorescent in situ hybridization (FISH) with 
RMC20C001 was used to assess 20ql3.2 amplification in 132 primary and II recurrent 
breast tumors. The absolute copy number (mean number of hybridization signals per 
cell) and the level of amplification (mean number of signals relative to the p-arm 
reference probe) were determined. Two types of amplification were found: Low level 
amplification (1.5-3 fold with FISH signals dispersed throughout the tumor nuclei) and 
high level amplification (>3 fold with tightly clustered FISH signals). Low level 
20ql3.2 amplification was found in 29 of the 132 primary tumors (22%), whereas nine 
cases (6.8%) showed high level amplification. 

RMC20C001 and four flanking PI probes (MC3R, PCK, RMC20CG26, 
and RMC20C030) were used to study the extent of DNA amplification in highly 
amplified tumors. Only RMC20C001 was consistently amplified in all tumors. This 
finding confirmed that the region of common amplification is within a 2 Mb interval 
flanked by but not including PCK-1 and MC3R. 

A physical map was assembled to further localize the minimum common 
region of amplification and to isolate the postulated pncogene(s). The DuPont PI library 
(Shepherd et al Proc. Natl. Acad. ScL USA, 91: 2629 (1994) was screened for STSs 
likely to map in band 20ql3.2. PI clones at the loci D20S102, D20S100, D20S120, 
D20S183, D20S480, D20S211 were isolated, and FISH localized each to 20ql3.2. 
Interphase FISH analysis was then performed in the breast cancer cell line BT474 to 
ms^ss the amplification level at each locus. The loci D20S100-D20S120-D20S183- 
D20S480-D20S211 were highly amplified in the BT474 cell line, whereas D20S102 
detected only low level amplification. Therefore, 5 STSs, spanning 5 cM, were localized 
within the 20ql3.2 amplicon and were utilized to screen the CEPH megaYAC library. 

CEPH megaYAC library screening and computer searches of public 
databases revealed D20S120-D20S183-D20S480-D20S211 to be linked on each of three 
megaYAC clones y820f5, 773hl0, and 931h6 (Figure 3). Moreover, screening the 
CEPH megaYAC library with STSs generated from the ends of cosmids RMC20C001, 
RMC20C3O and RMC20C028 localized RMC20C001 to each of the same three YAC 
clones. It was estimated, based on the size of the smallest of these YAC clones, that 
D20S120-D20S 183-RMC20CO01-D20S480-D20S2I 1 map into an interval of less than 
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1.1 Mb. D20S100 was localized 300 kb distal to D20S120 by interphase FISH and to 
YAC901bl2 by STS mapping. The combined STS data made it possible to construct a 
12 member YAC contig which spans roughly 4 Mb encompassing the' 1.5 Mb amplicon 
and containing the loci RMC20C030-PCK1-RMC20C001-MC3R-RMC20CO26. Each 
5 YAC was mapped by FISH to confirm localization to 20ql3.2 and to check for 
chimerism. Five clonal isolates of each YAC were sized by pulsed field gel 
electrophoresis (PFGE). None of the YACs are chimeric, however, several are highly 
unstable. 

The YAC contig served as a framework from which to construct a 2 MbPl 

10 contig spanning the 20Q13 amplicon. PI clones offered numerous advantages over YAC 
clones including (1) stability, (2) a chimeric frequency of less than 1 % t (3) DNA 
isolation by standard miniprep procedures, (4) they make ideal FISH probes, (5) the ends 
can be sequenced directly, (6) engineered yd transposons carrying bidirectional primer 
binding sites can be integrated at any position in the cloned DNA (Strathmann et al . 

15 Proc. NatL Acad. Sci. USA, 88: 1247 (1991)) (7) PI clones are the templates for 

sequencing the human and Drosophila genomes at the LBNL HGC (Palazzolo et aL DOE 
Human Genome Program, Contractor-Grantee Workshop IV. Santa Fe. New Mexico, 
November 13-17 1994). 

About 90 PI clones were isolated by screening the DuPont PI library 

20 either by PCR or filter hybridization. For PCR based screening, more than 22 novel 

STSs were created by two methods. In the first method, the ends of PI clones localized 
to the amplicon were sequenced, STSs developed, and the PI library screened for 
walking clones. In the second approach inter- Alu PCR (Nelson et al. , 86: 6686-6690 
(1989)) was performed on YACs spanning the amplicon and the products cloned and 

25 sequenced for STS creation. In the filter based hybridization scheme PI clones were 
obtained by performing inter-Alu PCR on YACs spanning the amplicon, radio-labeling 
the products and hybridizing these against filters containing a gridded array of the PI 
library. Finally, to close gaps a human genomic bacterial artificial chromosome (BAC) 
library (Shizuya et aL Proc. NatL Acad. Sci. USA, 89: 8794 (1992), commercially 

30 available from Research Genetics, Huntsville, Alabama, USA) was screened by PCR. 

These methods combined to produce more than 100 PI and BAC clones were localized to 
20ql3.2 by FISH. STS content mapping, fingerprinting, and free-chromatin fish 
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- (Heiskanen et al., BioTechniques, 17: 928 (1994)) were used to construct the 2 Mb 
contig shown in Figure 3. 

Fine Mapping the 2n q i 3.2 Amnlicon in BT474 
5 Clones from the 2 Mb PI contig were used with FISH to map the level of 

amplification at 20ql3,2 in the breast cancer cell line BT474. 35 PI probes distributed 
at regular intervals along the contig were used. The resulting data indicated that the 
region of highest copy number increase in BT474 occurs between D20S100 and 
D20S211, an interval of approximately 1.5 Mb. PI FISH probes, in this interval, detect 
10 an average of 50 signals per interphase nuclei in BT474, while no, or only low level 
amplification, was detected with the PI clones outside this region. Thus, both the 
proximal and distal boundaries of the amplicon were cloned. 

Fine Mapping the 20q l3.2 Amnlicon in Primary Tumors . 
15 Fine mapping the amplicon in primary tumors revealed the minimum 

common region of high amplification that is of pathobiological significance. This process 
is analogous to screening for informative meiosis in the narrowing of genetic intervals 
encoding heritable disease genes. Analysis of 132 primary tumors revealed thirty-eight 
primary tumors that are amplified at the RMC20C001 locus. Nine of these tumors have 
20 high level amplification at the RMC20C001 locus and were further analyzed by 

interphase FISH with 8 Pis that span the «2 Mb contig. The minimum common region 
of amplification was mapped to a «600 kb interval flanked by PI clones #3 and #12 
with the highest level of amplification detected by PI clone #38 corresponding to 
RMC20C001 (Figure 4). 
25 The PI clones spanning the 600 kb interval of the 20ql3 amplicon are 

listed in Table 1 which provides a cross-reference to the DuPont PI library described by 
Shepherd, et al.. Proc. Natl. Acad. Sci. USA, 92: 2629 (1994). These PI probes are 
available commercially from Genetic Systems. 
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The BAC clones of the contig spannng the 20ql3 amplicon are listed in 
Table 2 which provides a cross-reference to the DuPont PI library described by 
Shepherd, et aL.Proc. Natl. Acad. ScL USA, 92: 2629 (1994). These probes are 
5 available commercially from Research Genetics. 

In addition, Tables 1 and 2 provide PCR primers for amplifying sequence 
tagged sites (STSs) for each of the PI and BAC probes. One of skill in the art would 
appreciate that using plate coordinates provided above and/or the primers 

io Exptj Trapping 

Exon trapping (see, e.g.. Duyk et qL 9 Proc. Natl. Acad. ScL USA, 87: 
8995-8999 (1990) and Church et al.. Nature Genetics, 6: 98-105 (1994)) was performed 
on the PI and BAC clones spanning the ~600 kb minimum common region of 
amplification and has isolated more than 200 exons. 
15 Analysis of the exons DNA sequence revealed a number of sequence 

similarities (85% to 96%) to partial CDNA sequences in the expressed sequence data 
base (dbest) and to a S. cerevisiae chromosome XIV open reading frame. Each Pi clone 
subjected to exon trapping has produced multiple exons consistent with at least a medium 
density of genes. 

20 

Detection of 20ql3 Abnormalities. 

One of skill in the art will appreciate that the clones and sequence 
information provided herein can be used to detect amplifications, or other chromosomal 
abnormalities, at 20ql3 in a biological sample. Generally the methods involve 
25 hybridization of probes that specifically bind one or more nucleic acid sequences of the 
target amplicon with nucleic acids present in a biological sample or derived from a 
biological sample. 

As used herein, a biological sample is a sample of biological tissue or fluid 
contains cells in it is desired to screen for chromosomal abnormalities (e.g. 
30 amplifications of deletions). In a preferred embodiment, the biological sample is a cell 
or tissue suspected of being cancerous (transformed). Methods of isolating biological 
samples are well known to those of skill in the art and include, but are not limited to, 
aspirations, tissue sections, needle biopsies, and the like. Frequently the sample will be 
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a "clinical sample" which is a sample derived from a patient. It will be recognized that 
the term "sample" also includes supernatant (containing cells) or the cells themselves 
from cell cultures, cells from tissue culture and other media in which it may be desirable 
to detect chromosomal abnormalities. 

In a preferred embodiment, a biological sample is prepared by depositing 
cells, either as single cell suspensions or as tissue preparation, on solid supports such as 
glass slides and fixed by choosing a fixative which provides the best spatial resolution of 
the cells and the optimal hybridization efficiency. 

Selecting Probey 

Any of the PI probes listed in Table 1 or the BAC probes listed in Table 2 
are suitable for use in detecting the 20ql3 amplicon* Methods of preparing probes are 
well known to those of skill in the art {see, e.g. Sambrook et al % Molecular Cloning: A, 
Laboratory Manual (2nd ed.) % Vols. 1-3, Cold Spring Harbor Laboratory, (1989) or 
Current Protocols in Molecular Biology, F. Ausubel et al , ed. Greene Publishing and 
Wileyrlnterscience, New York (1987)) 

The probes are most easily prepared by combining and labeling one or 
more of the constructs listed in Tables 1 and 2. Prior to use, the constructs are 
fragmented to provide smaller nucleic acid fragments that easily penetrate the cell and 
hybridize to the target nucleic acid. Fragmentation can be by any of a number of 
methods well known to hose of skill in the an. Preferred methods include treatment with 
a restriction enzyme to selectively cleave the molecules, or alternatively to briefly heat 
the nucleic acids in the presence of Mg } +. Probes are preferably fragmented to an 
average fragment length ranging from about 50 bp to about 2000 bp, more preferably 
from about 100 bp to about 1000 bp and most preferably from about 150 bp to about 500 
bp. 

Alternatively, probes can be produced by amplifying (e.g. via PCR) 
selected subsequences from the 20ql3 amplicon disclosed herein. The sequences 
provided herein permit one of skill to select primers that amplify sequences from one or 
more exons located within the 20ql3 amplicon. 

Particularly preferred probes include nucleic acids from probes 38, 40, and 
79, which corresponds to RMC20C00L 
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One of skill will appreciate that using the sequence information and clones 
provided herein, one of skill in the art can isolate the same or similar probes from other 
human genomic libraries using routine methods (e.g. Southern or Northern Blots). 

S Milling Prpfrre 

Methods of labeling nucleic acids are well known to those of skill in the 
art. Preferred labels are those that are suitable for use in in situ hybridization. The 
nucleic acid probes may be detectably labeled prior to the hybridization reaction. 
Alternatively, a detectable label which binds to the hybridization product may be used. 
10 Such detectable labels include any material having a detectable physical or chemical 
property and have been well-developed in the field of immunoassays. 

As used herein, a "label" is any composition detectable by spectroscopic, 
photochemical, biochemical, immunochemical, or chemical means. Useful labels in the 
present invention include radioactive labels (e.g. 32 P, m I, M C, 3 H, and 3S S), fluorescent 
15 dyes (e.g. fluorescein, rhodamine, Texas Red, etc.), electron-dense reagents (e.g. gold), 
enzymes (as commonly used in an ELISA), colorimetric labels (e.g. colloidal gold), 
magnetic labels (e.g. Dynabeads™), and the like. Examples of labels which are not 
directly detected but are detected through the use of directly detectable label include 
biotin and dioxigenin as well as haptens and proteins for which labeled antisera or 
20 monoclonal antibodies are available. 

The particular label used is not critical to the present invention, so long as 
it does not interfere with the in situ hybridization of the stain. However, stains directly 
labeled with fluorescent labels (e.g. fluorescein- 12-dUTP, Texas Red-5-dUTP, etc.) are 
preferred for chromosome hybridization. 
25 A direct labeled probe, as used herein, is a probe to which a detectable 

label is attached. Because the direct label is already attached to the probe, no subsequent 
steps are required to associate the probe with the detectable label. In contrast, an 
indirect labeled probe is one which bears a moiety to which a detectable label is 
subsequently bound, typically after the probe is hybridized with the target nucleic acid. 
30 In addition the label must be detectible in as low copy number as possible 

thereby maximizing the sensitivity of the assay and yet be detectible above any 
background signal. Finally, a label must be chosen that provides a highly localized 
signal thereby providing a high degree of spatial resolution when physically mapping the 
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stain against the chromosome. Particularly preferred fluorescent labels include 
fluorescein- 12-dUTP and Texas Red-5-dUTP. 

The labels may be coupled to the probes in a variety of means known to 
those of skill in the an. In a preferred embodiment the nucleic acid probes will be 
labeled using nick translation or random primer extension (Rigby, et al J. MoL BioL , 
1 13; 237 (1977) or Sambrook, et aL. Molecular Cloning - A Laboratory Manual, Cold 
Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1985)). 

One of skill in the art will appreciate that the probes of this invention need 
not be absolutely specific for the targeted 20qI3 region of the genome. Rather, the 
probes are intended to produce "staining contrast % -Contrast" is quantified by the ratio 
of the probe intensity of the target region of the genome to that of the other portions of 
the genome. For example, a DNA library produced by cloning a particular chromosome 
(e.g. chromosome 7) can be used as a stain capable of staining the entire chromosome. 
The library contains both sequences found only on that chromosome, and sequences 
15 shared with other chromosomes. Roughly half the chromosomal DNA falls into each 

class. If hybridization of the whole library were capable of saturating all of the binding 
sites on the target chromosome, the target chromosome would be twice as bright 
(contrast ratio of 2) as the other chromosomes since it would contain signal from the both 
the specific and the shared sequences in the stain, whereas the other chromosomes would 
20 only be stained by the shared sequences. Thus, only a modest decrease in hybridization 
of the shared sequences in the stain would substantially enhance the contrast. Thus 
contaminating sequences which only hybridize to non-targeted sequences, for example, 
impurities in a library, can be tolerated in the stain to the extent that the sequences do 
not reduce the staining contrast below useful levels. 

25 

Detecting the 20ol3 Ampliron 

As explained above, detection of amplification in the 20ql3 amplicon is 
indicative of the presence and/or prognosis of a large number of cancers. These include, 
but are not limited to breast, ovary, bladder, head and neck, and colon. 
30 In a preferred embodiment, a 20ql3 amplification is detected through the 

hybridization of a probe of this invention to a target nucleic acid (e.g. a chromosomal 
sample) in which it is desired to screen for the amplification. Suitable hybridization 
formats are well known to those of skill in the art and include, but are not limited to, 
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variations of Southern Blots, in situ hybridization and quantitative amplification methods 
such as quantitative PCR (see, e.g. Sambrook, supra., Kallioniemi et al, Proc. Natl 
Acad Sci USA, 89: 5321-5325 (1992), and PCR Protocols^ A Guide to Methods and 
Applications, Innis et ai, Academic Press, Inc. N.Y., (1990)). 

5 

In situ Hybridization. 

In a preferred embodiment, the 20ql3 amplicon is identified using in situ 
hybridization. Generally, in situ hybridization comprises the following major steps: (1) 
fixation of tissue or biological structure to analyzed; (2) prehybridization treatment of the 

10 biological structure to increase accessibility of target DNA, and to reduce nonspecific 
binding; (3) hybridization of the mixture of nucleic acids to the nucleic acid in the 
biological structure or tissue; (4) posthybridization washes to remove nucleic acid 
fragments not bound in the hybridization and (5) detection of the hybridized nucleic acid 
fragments. The reagent used in each of these steps and their conditions for use vary 

15 depending on the particular application. 

In some applications it is necessary to block the hybridization capacity of 
repetitive sequences. In this case, human genomic DNA is used as an agent to block 
such hybridization. The preferred size range is from about 200 bp to about 1000 bases, 
more preferably between about 400 to about 800 bp for double stranded, nick translated 

20 nucleic acids. 

Hybridization protocols for the particular applications disclosed here are 
described in Pinkel et ai Proc. Natl. Acad. ScL USA, 85: 9138-9142 (1988) and in EPO 
Pub. No. 430,402. Suitable hybridization protocols can also be found in Methods o\in 
Molecular Biology Vol 33: In Situ Hybridization Protocols* K.H.A. Choo, ed., Humana 
25 Press, Totowa, New Jersey, (1994). In a particularly preferred embodiment, the 

hybridization protocol of Kallioniemi et al., Proc. Natl Acad Sci USA, 89: 5321-5325 
(1992) is used. 

Typically, it is desirable to use dual color FISH, in which two probes are 
utilized, each labelled by a different fluorescent dye. A test probe that hybridizes to the 
30 region of interest is labelled with one dye, and a control probe that hybridizes to a 

different region is labelled with a second dye. A nucleic acid that hybridizes to a stable 
portion of the chromosome of interest, such as the centromere region, is often most 
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useful as the control probe. In this way, differences between efficiency of hybridization 
from sample to sample can be accounted for. 

The FISH methods for detecting chromosomal abnormalities can be 
performed on nanogram quantities of the subject nucleic acids. Paraffin embedded tumor 
5 sections can be used, as can fresh or frozen material. Because FISH can be applied to 
the limited material, touch preparations prepared from uncultured primary tumors can 
also be used (see. e.g.. Kailioniemi, A. etal., Cytogenet. Cell Genet. 60: 190-193 
(1992)). For instance, small biopsy tissue samples from tumors can be used for touch 
preparations (see. e.g., Kailioniemi, A. etal., Cytogenet. Cell Genet. 60: 190-193 
10 (1992)). Small numbers of cells obtained from aspiration biopsy or cells in bodily fluids 
(e.g. , blood, urine, sputum and the like) can also be analyzed. For prenatal diagnosis, 
appropriate samples will include amniotic fluid and the like. 

Southern Bln^ 

15 In a Southern Blot, a genomic or cDNA (typically fragmented and 

separated on an electrophoretic gel) is hybridized to a probe specific for the target 
region. Comparison of the intensity of the hybridization signal from the probe for the 
target region (e.g. , 20ql3) with the signal from a probe directed to a control (non 
amplified) such as centromeric DNA, provides an estimate of the relative copy number 

20 of the target nucleic acid. 

Kits Containing 20ol3 A mnlicon Prft t yp ff, 

This invention also provides diagnostic kits for the detection of 
chromosomal abnormalities at 20ql3. In a preferred embodiment, the kits include one or 

25 more probes to the 20ql3 amplicon described herein. The kits can additionally include 
blocking probes, instructional materials describing how to use the kit contents in 
detecting 20ql3 amplicons. The kits may also include one or more of the following: 
various labels or labeling agents to facilitate the detection of the probes, reagents for the 
hybridization including buffers, a metaphase spread, bovine serum albumin (BSA) and 

10 other blocking agents, sampling devices including fine needles, swabs, aspirators and the 
like, positive and negative hybridization controls and so forth. 

EXAMPLES 
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The following examples are offered to illustrate, but not to limit the 
present invention. 

Example 1 

PROGNOSTIC IMPLICATIONS OF AMPLIFICATION OF fH ROMOSOMAl. 
5 REGION 20ql3 IN BREAST CANCER 

Patients and tumor material 

Tumor samples were obtained from 152 women who underwent surgery 
for breast cancer between 1987 and 1992 at the Tampere University or City Hospitals, 
One hundred and forty-two samples were from primary breast carcinomas and 1 1 from 

10 metastatic tumors. Specimens from both the primary tumor and a local metastasis were 
available from one patient. Ten of the primary tumors that were either in situ or 
mucinous carcinomas were excluded from the material, since the specimens were 
considered inadequate for FISH studies. Of the remaining 132 primary tumors, 128 
were invasion ductal and 4 lobular carcinomas. The age of the patients ranged from 29 

15 to 92 years (mean 61). Clinical follow-up was available from 129 patients. Median 

follow-up period was 45 months (range 1.4-1.77 months). Radiation therapy was given 
to 77 of the 129 patients (51 patients with positive and 26 with negative lymph nodes), 
and systemic adjuvant therapy to 36 patients (33 with endocrine and 3 with cytotoxic 
chemotherapy). Primary tumor size and axillary node involvement were determined 

20 according to the tumor-node metastasis (TNM) classification. The histopathological 
diagnosis was evaluated according to the World Health Organization (11). The 
carcinomas were graded on the basis of the tubular arrangement of cancer cells, nuclear 
atypia, and frequency of mitotic or hyperchromatic nuclear figures according to Bloom 
and Richardson, Br. 7. Cancer, 11: 359-377 (1957). 

25 Surgical biopsy specimens were frozen at -70°C within 15 minutes of 

removal. Cryostat sections (5-6 /im) were prepared for intraoperative histopathological 
diagnosis, and additional thin sections were cut for immunohistochemical studies. One 
adjacent 200 thick section was cut for DNA flow cytometric and FISH studies. 

30 Cell preparation for FISH . 

After histological verification that the biopsy specimens contained a high 
proportion of tumor cells, nuclei were isolated from 200 fim frozen sections according to 
a modified Vindelov procedure for DNA flow cytometry, fixed and dropped on slides for 
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FISH analysis as described by Hyytinen et aL. Cytometry 16: 93-99 (1994). Foreskin 
fibroblasts were used as negative controls in amplification studies and were prepared by 
harvesting cells at confluency to obtain Gl phase enriched interphase nuclei. All samples 
were fixed in methanol -acetic-acid (3:1). 

5 

Pfpbes. 

Five probes mapping to the 20ql3 region were used (see Stokke, et aL. 
Genomics, 26: 134-137 (1995)). The probes included Pl-clones for melanocortin-3- 
receptor (probe MC3R, fractional length from p-arm telomere (Flpter 0.81) and 

10 phosphoenolpyruvate carboxy kinase (PCK. Flpter 0.84), as well as anonymous cosmid 
clones RMC20C026 (Flpter 0.79). In addition, RMC20C001 (Flpter 0.825) and 
RMC20C030 (Flpter 0.85) were used. Probe RMC20C001 was previously shown to 
define the region of maximum amplification (Tanner et aL. Cancer Res, 54: 4257-4260 
(1994)). One cosmid probe mapping to the proximal p-arm, RMC20C038 (FLpter 

15 0.237) was used as a chromosome-specific reference probe. Test probes were labeled 
with biotin- 1 4-dATP and the reference probe with digoxigenin- 1 1 -dUTP using nick 
translation (Kallioniemi et aL. Proc. Natl Acad Sci USA. 89: 5321-5325 (1992)). 

Fluorescenc e in situ hybridization 
20 Two-color FISH was performed using biotin-labeled 20ql3-specific probes 

and digoxigenin-iabelled 20p reference probe essentially as described (Id.). Tumor 
samples were postfixed in 4% paraformaldtheyde/phosphate-buffered saline for 5 min at 
4 C prior to hybridization, dehydrated in 70%, 85% and 100% ethanol, air dried, and 
incubated for 30 min at 80°C. Slides Were denatured in a 70% formamide/2x standard 
25 saline citrate solution at 72-74°C for 3 min, followed by a proteinase K digestion (0.5 

Mg/ml). The hybridization mixture contained 18 ng of each of the labeled probes and 10 
fig human placental DNA. After hybridization, the probes were detected 
immunochemically with avidin-FITC and anti-digoxigenin Rhodamine. Slides were 
counterstained with 0.2 M M 4,6-diamidino-2-phenylindole (DAPI) in an anufade solution. 



30 



Fluorescence microscopy and s coring nf signals in interphase nuclei 

A Nikon fluorescence microscope equipped with double band-bass filters 
(Chromatechnology, Brattleboro, Vermont, USA) and 63 x objective (NA 1.3) was used 
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for simultaneous visualization of FITC and Rhodamine signals. At least 50 non- 
overlapping nuclei with intact morphology based on the DAPI counterstaining were 
scored to determine the number of test and reference probe hybridization signals. 
Leukocytes infiltrating the tumor were excluded from analysis. Control hybridizations to 
5 normal fibroblast interphase nuclei were done to ascertain that the probes recognized a 
single copy target and that the hybridization efficiencies of the test and reference probes 
were similar. 

The scoring results were expressed both as the mean number of 
hybridization signals per cell and as mean level of amplification (- mean of number of 
10 signals relative to the number of reference probe signals). 

PNA flow cytometry and steroid receptor analyses. 

DNA flow cytometry was performed from frozen 200 /im sections as 
described by Kallioniemi, Cytometry 9: 164-169 (1988). Analysis was carried out using 

15 an EPICS C flow cytometer (Coulter Electronics Inc., Hialeah, Forida, USA) and the 

MukiCycle program (Phoenix Flow Systems, San Diego, California, USA). DNA-index 
over 1.07 (in over 20% of cells) was used as a criterion for DNA aneuploidy. In DNA 
aneuploid histograms, the S -phase was analyzed only from the aneuploid clone. Cell 
cycle evaluation was successful in 86% (108/126) of the tumors. 

20 Estrogen (ER) and progesterone (PR) receptors were detected 

immunohistochemically. from cryostat sections as previously described (17). The 
staining results were semiquantitatively evaluated and a histoscore greater than or equal 
to 100 was considered positive for both ER and PR (17). 

25 Statistical Methods . 

Contingency tables were analyzed with Chi square test for trend. 
Association between S-phase fraction (continuous variable) and 20ql3 amplification was 
analyzed with Kruskal-WaJlis test. Analysis of disease-free survival was performed using 
the BMDPIL program and Mautel-Cox test and Cox's proportional hazards model 

30 (BMDP2L program) was used in multivariate regression analysis (Dixon BMDP 

Statistical Software. London, Berkeley, Los Angeles: University of California Press, 
(1981)). 
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Amplification of 20ql3 in primary breast carcinomas bv fl uorescence in situ 
hybridization . 

The minimal region probe RMC20C001 was used in FISH analysis to 
zssess the 20ql3 amplification. FISH was used to analyze both the total number of 
5 signals in individual tumor cells and to determine the mean level of amplification (mean 
copy number with the RMC20C001 probe relative to a 20p-reference probe). In 
addition, the distribution of the number of signals in the tumor nuclei was also assessed. 
Tumors were classified into three categories: no. low and high level of amplification. 
Tumors classified as not amplified showed less than 1.5 than 1.5 fold-copy number of 
10 the RMC20C001 as compared to the p-arm control. Those classified as having low-level 
amplification had 1.5-3-fold average level of amplification. Tumors showing over 3-fold 
average level of amplification were classified as highly amplified. 

The highly amplified tumors often showed extensive intratumor 
15 heterogeneity with up to 40 signals in individual tumor cells. In highly amplified 

tumors, the RMC20C001 probe signals were always arranged in clusters by FISH, which 
indicates location of the amplified DNA sequences in close proximity to one another e.g. 
in a tandem array. Low level 20ql3 amplification was found in 29 of the 132 primary 
tumors (22%), whereas nine cases (6.8%) showed high level amplification. The overall 
20 prevalence of increased copy number in 20ql3 was thus 29% (38/132). 

Defining the minimal region of amplification. 

The average copy number of four probes flanking RMC20C001 was 
determined in the nine highly amplified tumors. The flanking probes tested were 
25 malanocortin-3-receptor (MC3R, FLpter 0.81), phosphoenolpyruvate carboxykinase 
(PCK, 0.84), RMC20C026 (0.79) and RMC20C030 (0.85). The amplicon size and 
location varied slightly from one tumor to another but RMC20C001 was the only probe 
consistently highly amplified in all nine cases. 

*0 Associati on of 20q13 amplification with pathological and biological features. 

The 20qI3 amplification was significantly associated with high histologic 
grade of the tumors (p=0.01); This correlation was seen both in moderately and highly 
amplified tumors (Table 1). Amplification of 20ql3 was also significantly associated 
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with aneuploidy as determined by DNA flow cytometry (p=0.01. Table 1.) The mean 
cell proliferation activity, measured as the percentage of cells in the S-phase fraction, 
increased (p =0.0085 by Kruskal-Wallis test) with the level of amplification in tumors 
with no, low and high levels of amplification (Table I). No association was found with 
5 the age of the patient, primary tumor size, axillary nodal or steroid hormone-receptor 
status (Table 1). 

Table 1. Clinicopathological correlations of amplification at chromosomal region 20ql3 
in 1 32 primary breast cancers. ssa _ M _ ===== _ = _____-^^ 

10 Pathobiologic 20gl3 amplification status p-value 1 

feature 



MO LOW LEVEL HIGH LEVEL 

Number of Number of Number of 

patients (%) patients (%) patients (%) 





All primary 
tumors 


94 


(71%) 


29 


(22%) 


9 


(6.8%) 




15 


Age of patients 
< 50 years 
> 50 years 


17 
77 


(65%) 
(73%) 


6 

23 


(23%) 
(22%) 


3 
6 


(12%) 
(5.7%) 


.39 


20 


Tumor size 
< 2 cm 
> 2 cm 


33 
58 


(79%) 
(67%) 


7 

22 


(17%) 
(25%) 


2 
7 


(4.8%) 
(8.0%) 


• 16 




Nodal status 
Negative 
Positive 


49 
41 


(67%) 
(75%) 


19 
10 


(26%) 
(18%) 


5 
4 


(6.8%) 
(7.3%) 


.41 


25 


Histologic grade 
I - II 
III 


72 
16 


(76%) 
(52%) 


IB 
11 


(19%) 
(35%) 


5 
4 


(5.3%) 
(13%) 


.01 


30 


Estrogen 
receptor status 

Negative 

Positive 


30 
59 


(67%) 
(72%) 


10 
19 


(22%) 
(23%) 


5 
4 


(11%) 
(4.9%) 


.42 




Progesterone 
receptor status 

Negative 

Positive 


57 
32 


(69%) 
(74%) 


20 
8 


(24%) 
(19%) 


6 
3 


(7.2%) 
(7.0%) 


.53 


35 


DNA ploidy 
Diploid 
Aneuploid 

S-phase fraction 
<%) 


45 (82%) 
44 (62%) 

mean ± SD 


8 (14.5%) 
20 (28%) 

mean ± SD 


2 (3.6%) 
7 (10%) 

mean ± SD 


.01 
.0085 1 




9.9 


J. 7,2 


12. 


6 l 6.7 


19, 


0 t 10.5 





40 1 KruBltal-Wallis Test. 

Relationship between 20ol3 amplification a nd disease-free survival. 

Disease-free survival of patients with high-level 20ql3 amplification was 

significantly shorter than for patients with no or only low-level amplification (p-0.04). 
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Disease-free survival of patients with moderately amplified tumors did not differ 
significantly from that of patients with no amplification. Among the node-negative 
patients (n=79), high level 20ql3 amplification was a highly significant prognostic factor 
for shorter disease-free survival (p=0.002) t even in multivariate Cox's regression 
5 analysis (p =0.026) after adjustment for tumor size ER, PR grade, ploidy and S-phase 
fraction. 



20q13 amplificatio n in metastatic breast tumors . 

Two of 1 1 metastatic breast tumors had low level and one high level 
10 20ql3 amplification. Thus, the overall prevalence (27%) of increased 20ql3 copy 

number in metastatic tumors was a similar to that observed in the primary tumors. Both 
a primary and a metastatic tumor specimens were available from one of the patients. 
This 29-year old patient developed a pectoral muscle infiltrating metastasis eight months 
after total mastectomy. The patient did not receive adjuvant or radiation therapy after 
15 mastectomy. The majority of tumor cells in the primary tumor showed a low level 
amplification, although individual tumor cells (less than 5% of total) contained 8-20 
copies per cell by FISH. In contrast, all tumor cells from metastasis showed high level 
20qI3 amplification (12-50 copies per cell). The absolute copy number of the reference 
probe remained the same suggesting that high level amplification was not a result of an 
20 increased degree of aneuploidy. 

Diagnostic an d Prognostic Value of the 20ol3 Amplification. 

The present findings suggest that the newly-discovered 20ql3 amplification 
may be an important component of the genetic progression pathway of certain breast 

25 carcinomas. Specifically, the foregoing experiments establish that: 1) High-level 20ql3 
amplification, detected in 7% of the tumors, was significantly associated with decreased 
disease-free survival in node-negative breast cancer patients, as well as with indirect 
indicators of high-malignant potential, such as high grade and S-phase fraction. 2) Low- 
level amplification, which was much more common, was also associated with 

50 clinicopaihological features of aggressive tumors, but was not prognostically significant. 

3) The level of amplification of RMC20C001 remains higher than amplification of nearby 
candidate genes and loci indicating that a novel oncogene is located in the vicinity of 
RMC20C001. 
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High-level 20ql3 amplification was defined by the presence of more than 
3-fold higher copy number of the 20ql3 amplification is somewhat lower than the 
amplification frequencies reported for some of the other breast cancer oncogenes, such as 
ERBB2 (17ql2) and Cyclin-D (llql3) (Borg et aL, Oncogene, 6: 137-143 (1991), Van 
5 de Vijver et aL Adv. Cane. Res., 61: 25-56 (1993)). However, similar to what has been 
previously found with these other oncogenes (Swab, et al.. Genes Chrom. Cone,, 1: 181- 
193 (1990), Borg et al. t supra.), high-level 20ql3 amplification was more common in 
tumors with high grade or high S-phase fraction and in cases with poor prognosis. 
Although only a small number of node-negative patients was analyzed, our results 

10 suggest that 20ql3 amplification might have independent role as a prognostic indicator. 
Studies to address this question in large patient materials are warranted. Moreover, 
based on these survival correlations, the currently unknown, putative oncogene amplified 
in this locus may confer an aggressive phenotype. Thus, cloning of this gene is an 
important goal. Based on the association of amplification with highly proliferative 

15 tumors one could hypothesize a role for this gene in the growth regulation of the cell. 

The role of the low-level 20ql3 amplification as a significant event in 
tumor progression appears less clear. Low-level amplification was defined as 1.5-3-fold 
increased average copy number of the 20ql3 probe relative to the p-arm control. In 
addition, these tumors characteristically lacked individual tumor cells with very high 

20 copy numbers, and showed a scattered, not clustered, appearance of the signals. 

Accurate distinction between high and low level 20ql3 amplification can only be reliably 
done by FISH, whereas Southern and slot blot analyses are likely to be able to detect 
only high-level amplification, in which substantial elevation of the average gene copy 
number lakes place. This distinction is important, because only the high amplified 

25 tumors were associated with adverse clinical outcome. Tumors with low-level 20ql3 

amplification appeared to have many clinicopathological features that were in between of 
those found for tumors with no and those with high level amplification. For example, 
the average tumor S-phase fraction was lowest in the non-amplified tumors and highest in 
the highly amplified tumors. One possibility is that low-level amplification precedes the 

30 development of high level amplification. This has been shown to be the case, e.g.* in 
the development of drug resistance-gene amplification in vitro (Stark, Adv. Cane. Res., 
61: 87-113 (1993)). Evidence supporting this hypothesis was found in one of our 
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patients, whose local metastasis contained a much higher level of 20ql3 amplification 
than the primary tumor operated 8 months earlier. 

Finally, our previous paper reported a 1.5 Mb critical region defined by 
RMC20C001 probe and exclusion of candidate genes in breast cancer cell lines and in a 
limited number of primary breast tumors. Results of the present study confirm these 
findings by showing conclusively in a larger set of primary tumors that the critical region 
of amplification is indeed defined by this probe. 

The present data thus suggest that the high-level 20ql3 amplification may 
be a significant step in the progression of certain breast tumors to a more malignant 
phenotype. The clinical and prognostic implications of 20ql3 amplification are striking 
and location of the minimal region of amplification at 20ql3 has now been defined. 

It is understood that the examples and embodiments described herein are 
for illustrative purposes only and that various modifications or changes in light thereof 
will be suggested to persons skilled in the art and are to be included within the spirit and 
purview of this application and scope of the appended claims. All publications, patents, 
and patent applications cited herein are hereby incorporated by reference for all 
purposes. 
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WHAT IS CLAIMED IS : 

1 1. A method of detecting a chromosome abnormality at about 

2 position FLpter 0.825 on human chromosome 20, the method comprising: 

3 contacting a chromosome sample from a patient with a 

4 composition consisting essentially of one or more labeled nucleic acid probes 

5 each of which binds selectively to a target polynucleotide sequence at about 

6 position FLpter 0.825 on human chromosome 20 under conditions in which the 

7 probe forms a stable hybridization complex with the target sequence; and 

8 detecting the hybridization complex. 

1 2. The method of claim l t wherein the probe comprises a 

2 nucleic acid that specifically hybridizes under stringent conditions to a nucleic 

3 acid selected from a group consisting of the nucleic acids listed in Table l t and 

4 Table 2. 

1 3. The method of claim 2, wherein the probe is selected from 

2 a group consisting of the probes listed in Table 1 and Table 2. 

1 4. The method of claim 1, wherein the step of detecting the 

2 hybridization complex comprises determining the copy number of the target 

3 sequence. 

1 5. The method of claim 1, wherein the probe is labeled with 

2 digoxigenin or biotin. 

1 6. The method of claim 1 , wherein the hybridization complex 

2 is detected in interphase nuclei in the sample. 

1 7. The method of claim 1, wherein the chromosome 

2 abnormality is an amplification. 

1 8. The method of claim 1, wherein the probes are labeled 

2 with digoxigenin or biotin. 
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1 9. The method of claim 1 , wherein the step of detecting the 

2 hybridization complexes is carried out by detecting a fluorescent label. 

1 10. The method of claim 9, wherein the fluorescent label is 

2 FITC. 

1 11. The method of claim 1, further comprising contacting the 

2 sample with a reference probe which binds selectively to a chromosome 20 

3 centromere. 

1 12. A composition comprising a labeled nucleic acid probe 

2 which binds selectively to a target polynucleotide sequence at about FLpter 

3 0.825 on human chromosome 20. 

1 13. The composition of claim 12, wherein the probe is 

2 labelled with digoxigenin or biotin. 

1 14. The composition of claim 1, wherein the probe comprises 

2 a nucleic acid that hybridizes under stringent conditions to one or more nucleic 

3 acids selected from the group consisting of the nucleic acids listed in Table 1 

4 and Table 2. 

1 15. The composition of claim 14, wherein the probe comprises 

2 one or more nucleic acids selected from the group consisting of the nucleic 

3 acids listed in Table 1 and Table 2. 

1. 16. A kit for the detection of a chromosomal abnormality at 

. 2 about position FLpter 0.825 on human chromosome 20, the kit comprising a 

3 compartment which contains a labeled nucleic acid probe which binds 

4 selectively to a target polynucleotide sequence at about FLpter 0.825 on human 

5 chromosome 20. 
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1 17. The kit of claim 16, wherein label is selected from the 

2 group consisting of digoxigenin and biotin. 

1 18. The composition of claim 1, wherein the probe is one or 

2 more probes selected from the group consisting of the nucleic acids of Table 1, 

3 and Table 2. 

1 19. The kit of claim 16, further comprising a reference probe 

2 specific to a sequence in the centromere of chromosome 20. 

1 20. The kit of claim 16, further comprising Texas red avidin 

2 and biotin-labeled goat anti-avidin antibodies. 
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